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Summary 

Background: Chronic fatigue syndrome (CFS) is rela- 
tively common and disabling. Over 8000 patients 
attend adult services each year, yet little is known 
about the outcome of patients attending NHS 
services. 

Aim: Investigate the outcome of patients with CFS 
and what factors predict outcome. 
Design: Longitudinal patient cohort. 
Methods: We used data from six CFS/ME (myalgic 
encephalomyelitis) specialist services to measure 
changes in fatigue (Chalder Fatigue Scale), physical 
function (SF-36), anxiety and depression (Hospital 
Anxiety and Depression Scale) and pain (visual ana- 
logue pain rating scale) between clinical assessment 
and 8-20 months of follow-up. We used multivari- 
ate linear regression to investigate baseline factors 
associated with outcomes at follow-up. 
Results: Baseline data obtained at clinical assess- 
ment were available for 1643 patients, of whom 
834 (51%) had complete follow-up data. There 



were improvements in fatigue [mean difference 
from assessment to outcome: —6.8; 95% confidence 
interval (CI) -7.4 to -6.2; P< 0.001]; physical func- 
tion (4.4; 95% CI 3.0-5.8; P< 0.001), anxiety (-0.6; 
95% CI -0.9 to -0.3; P< 0.001), depression (-1.6; 
95% CI -1.9 to -1.4; P< 0.001) and pain (-5.3; 
95% CI -7.0 to -3.6; P< 0.001). Worse fatigue, 
physical function and pain at clinical assessment 
predicted a worse outcome for fatigue at follow-up. 
Older age, increased pain and physical function at 
assessment were associated with poorer physical 
function at follow-up. 

Conclusions: Patients who attend NHS specialist 
CFS/ME services can expect similar improvements 
in fatigue, anxiety and depression to participants 
receiving cognitive behavioural therapy and 
graded exercise therapy in a recent trial, but are 
likely to experience less improvement in physical 
function. Outcomes were predicted by fatigue, dis- 
ability and pain at assessment. 
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Introduction 

Chronic fatigue syndrome (CFS), also known as 
myalgic encephalomyelitis (ME), is defined as per- 
sistent or recurrent debilitating fatigue that is not 
lifelong, not the result of ongoing exertion, not alle- 
viated by rest, not explained by other conditions and 
results in a substantial reduction in function. 1 " 3 CFS 
is heterogeneous 4 and relatively common, with an 
estimated prevalence from population surveys of be- 
tween 0.2% and 2.6% depending on case definition 
and recruitment methodology. 5-10 Adults with CFS 
can be very disabled 11 and are generally unwell for 
a long time, with a median duration of illness of 6.3 

12 

years. 

Over 8000 adults 13 are assessed and treated by 
specialist UK National Health Service (NHS) CFS/ 
ME clinical teams each year and although we 
have trial evidence about which treatments are ef- 
fective, little is known about what happens when 
patients are treated in the NHS setting or which fac- 
tors are associated with treatment outcomes. One 
previous study showed that outcomes were better 
for patients with CFS enrolled in a trial of cognitive 
behavioural therapy (CBT; N=30) when compared 
with those receiving CBT as part of everyday clinical 
practice. 14 The only other service evaluation we 
found showed that secondary care outpatients who 
received CBT improved their function and symptom 
count, although only 61% gave follow-up data. 15 
There is clear trial evidence that CBT and graded 
exercise therapy (GET), are moderately effective for 
the treatment of CFS. 16,17 In a recent trial, CBT and 
GET were each associated with a mean improve- 
ment of ~7 points on the Chalder Fatigue Scale, 
comparing baseline with 52 weeks follow-up and 
~19 points on the SF-36 physical function 
subscale 17 . 

NHS specialist CFS/ME services in England follow 
National Institute for Health and Clinical Excellence 
(NICE) guidelines, offering CBT or GET, or compo- 
nents of CBT or GET alone or with activity manage- 
ment and sleep management. 3 For severely affected 
patients, the guidelines recommend an activity man- 
agement programme that draws on the principles of 
CBT and GET. 3 

In this study, we use data from the CFS/ME 
National Outcomes Database (NOD) to investigate: 
(i) what the prognosis is for patients accessing NHS 
specialist services; (ii) which routinely collected 
baseline measures predict post-treatment outcomes 
(fatigue, physical function, pain and mood) and (iii) 
whether outcomes are similar to those recorded in 
the Pacing, graded Activity and Cognitive Beha- 
vioural Therapy: a randomised Evaluation (PACE) 
trial for CBT and GET (the most effective treatments). 



Methods 
Study population 

We included patients who had attended one of six 
NHS specialist CFS/ME services during the period 1 
January 2005 to 31 December 2009. The six ser- 
vices were chosen because they had been collecting 
1 2-month outcome data for at least 1 year during the 
study period and had achieved >40% 1 2-month 
follow-up. Patients were included if they were age 
1 8 years or older and were diagnosed with CFS 
according to Centers for Disease Control criteria. 1,2 
Services treated patients with CBT, GET, a combin- 
ation of both or activity management, in group and/ 
or individual treatment sessions of varying numbers 
and lengths. 

Patient-level data 

In each of the six services, patients routinely com- 
pleted the following inventories prior to their initial 
clinical assessment (baseline): 11 -item Chalder 
Fatigue Scale; 18 10-item SF-36 physical function 
subscale; 19 14-item Hospital Anxiety and 
Depression Scale (HADS); 0 and a Visual Analogue 
Pain Rating Scale (score of 0 for 'no pain' and 100 
for 'pain as bad as possible'). The Chalder Fatigue 
Scale was scored using the 0-3 method for scoring 
each question (0 'Less than usual', 1 'No more than 
usual', 2 'More than usual' and 3 'Much more than 
usual'). On the physical function subscale of the 
SF-36 (RAND version), patients scored 0 ('Yes, lim- 
ited a lot') 5 ('Yes, limited a little') or 10 ('No, not 
limited at all') for each question, (range 0-100 with 
0 being most disabled). Inventory total scores (and 
each HADS subscale score) were coded as missing if 
more than one question was unanswered; if only 
one item was missing, an adjusted total score was 
calculated. The same set of inventories was sent to 
patients by post ~12 months after their initial clin- 
ical assessment. Patients were asked at assessment 
to report the duration of their illness (time elapsed, 
in months, between onset of symptoms and clinical 
assessment). Data on treatments received by each 
patient are not currently recorded in the NOD. 

Service-level data 

Information was collected from specialist services 
using a self-completed questionnaire which 
included questions on: numbers of patients assessed 
per annum; total number of staff employed (full-time 
equivalent); type of treatments offered (CBT, GET, 
activity management); number of treatment sessions 
the service usually offered/aimed to offer; ratio of 
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group/individual sessions and estimated average 
contact time with patients. 

Follow-up interval 

Each team sent out follow-up questionnaires at 12 
months. Variation in when the questionnaires were 
sent and delays in return of 12-month follow-up 
questionnaires led to variation in the exact time of 
follow-up. Also, some teams obtained data at add- 
itional follow-up points (e.g. 6 and 24 months). To 
maximize the availability of follow-up data for our 
analysis, we determined a margin of follow-up 
either side of 1 2 months. We did this by fitting frac- 
tional polynomial generalized estimating equation 
(GEE) regression models of fatigue against 
time. 21,22 This method (implemented in Stata as frac- 
poly combined with xtgee) compares a linear GEE 
model with the best-fitting first and second degree 
models, each containing fractional polynomial 
terms (from a pre-defined set of integer, fractional 
and negative powers) for time. The differences in 
deviances between the linear and 1 st -degree model 
and the first and second degree models are tested 
using a chi-squared test and the resulting P-values 
indicate whether the change in outcome over time is 
linear or whether it has a more complex shape. We 
inspected a plot of predicted values of fatigue 
against time from the model with the best fit to de- 
termine an appropriate follow-up interval in which 
observed fatigue scores could be assumed to be rep- 
resentative of the scores predicted at 12 months. In 
patients with more than one follow-up assessment, 
the closest to 12 months was used. 

Statistical analysis 

Our primary outcome measures were fatigue and 
physical function. Our secondary outcome meas- 
ures were anxiety, depression and pain. We used 
chi-squared tests (10-year age group and sex) and 
Kruskal-Wallis tests (duration of illness, fatigue, 
physical function, anxiety, depression and pain) to 
compare the baseline characteristics of patients for 
whom we had/did not have complete follow-up data 
on primary and secondary outcomes. We defined 
clinically useful improvements within each person 
as having a difference of 2 points on the total 
Chalder Fatigue Scale and a difference of 1 1 
points on the SF-36 physical function subscale 
between the baseline and follow-up measurements. 
We chose these cut-offs because they equated to 0.5 
SD of the distribution of the baseline measurements. 
These characteristics were included in a random 
effects linear regression model, with service as a 
unit of random effect, to identify factors 
measured at baseline that were independently 



associated with outcomes measured in the 
follow-up interval. Scores from the different inven- 
tories were re-scaled, for this particular analysis, so 
that the range for each was ~0-1 0 so that a regres- 
sion coefficient of 1 represents a 10% change in the 
score. 

Ethical approval 

The North Somerset and South Bristol Research 
Ethics Committee decided that the collection and 
analysis of CFS/ME patient data were part of service 
evaluation and as such did not require ethical 
review by a NHS Research Ethics Committee or ap- 
proval by NHS Research and Development offices 
(REC reference number 07/Q2006/48). 

Results 

Patients and follow-up 

Complete baseline data were available for 1643 
patients assessed by the six services during the 
study period. Of these patients, 1269 (77.2%) were 
female, the mean age was 39.9 (1 2.6 SD) years, and 
patients had been ill for a median of 36 months 
[interquartile range (IQR) 16-84] before accessing 
a service. Follow-up data on fatigue were available 
for 53.7% (882 of 1643) patients up to 24 months 
after assessment. The median interval between as- 
sessment and follow-up was 375 (IQR: 357-402) 
days. We fitted a fractional polynomial model to 
these data to determine the interval that represented 
a reasonably steady state of fatigue around 12 
months after assessment. The model with the best 
fit incorporated a term for Vtime. Fatigue was rela- 
tively constant between 8 and 20 months (240-600 
days) after assessment (Figure 1). Hence, we used 
this interval to capture the maximum amount of 
follow-up data. 

Of the 1643 patients with complete baseline data, 
834 (50.8%) had complete (fatigue, physical func- 
tion, anxiety, depression and pain) follow-up data 
between 8 and 20 months after assessment. We 
compared patients with and without complete 
follow-up data (Table 1). Patients without follow-up 
data were slightly younger than patients with 
follow-up data [mean age 38.5 (12.4 SD) years vs. 
41.3 (12.7 SD) years, Student's f-test P< 0.001] but 
were similar in all other characteristics. 

At follow-up, there were improvements in fatigue, 
SF-36 physical function, anxiety, depression and 
pain (Table 2). About 74% (620 of 834) of patients 
had a decreased Chalder Fatigue score at follow-up 
and 64% (534 of 834) had improved by >2 points 
(our definition of a clinically useful improvement). In 
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Table 1 Characteristics of CFS patients with and without follow-up data 



Characteristics 


Without follow-up 


With follow-up 


P-value a 




data (N = 809) 


data (N=834) 




Age (years) median (IQR) 


38 (28-47) 


41 (32-51) 


<0.001 


Female N (%) 


617 (76.3) 


652 (78.2) 


0.36 


Duration of illness (months) median (IQR) 


36 (17-96) 


36 (15-84) 


0.06 


Chalder Fatigue (0-33) median (IQR) 


27 (23-30) 


28 (23-30) 


0.22 


SF-36 physical (scale 0-100) median (IQR) 


40 (20-60) 


40 (25-55) 


0.46 


Anxiety (scale 0-21) median (IQR) 


10 (7-14) 


10 (7-13) 


0.30 


Depression (scale 0-21) median (IQR) 


9 (7-12) 


9 (7-12) 


0.92 


VPain (scale 0-100) median (IQR) 


52 (24-70) 


51 (24-69) 


0.52 


a Chi-squared test for categorical variables; Kruskal-Wallis test for continuous varia 


bles 




Table 2 Characteristics of CFS patients at baseline and at follow-up (N=834) 






Characteristics 


Baseline Follow-up 


Mean Difference 


P-value a 




mean (SD) mean (SD) 


(95% CI) 




Chalder Fatigue (0-33) 


26.5 (5.2) 19.7 (8.4) 


-6.8 (-7.4 to -6.2) 


<0.001 


SF-36 physical function (scale 0-100) 


40.6 (22.7) 45.0 (27.2) 


4.4 (3.0 to 5.8) 


<0.001 


HADS anxiety (scale 0-21) 


10.1 (4.6) 9.5 (4.6) 


-0.6 (-0.9 to -0.3) 


<0.001 


HADS depression (scale 0-21) 


9.6 (4.1) 7.9 (4.5) 


-1.6 (-1.9 to -1.4) 


<0.001 


Visual Analogue Pain (scale 0-100) 


47.3 (26.6) 42.0 (28.4) 


-5.3 (-7.0 to -3.6) 


<0.001 



a Student's paired f-test 



contrast, only 50% (416 of 834) of patients had an 
increased SF-36 physical function score at follow-up 
and only 16% (131 of 834) had improved by >22 
points. In total, 14% (120 of 834) had clinically 
useful improvements on both scales. 



Patient-level factors associated with 
treatment outcomes 

There was strong evidence (P< 0.001) that each 
baseline measure (fatigue, physical function, 
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anxiety, depression and pain) was associated with 
fatigue at 8-20 months in models that were only 
adjusted for centre and year of assessment 
(Table 3). In a model that was adjusted for all base- 
line measures, fatigue, physical function and pain, 
but not anxiety or depression scores, were asso- 
ciated with fatigue at 8-20 months. Similarly, older 
age, physical function and pain, were associated 
with physical function at 8-20 months. There was 
little evidence that anxiety and depression were 
associated with physical function at follow-up 
(Table 4). 

In fully adjusted regression models, each baseline 
measure was associated with its repeat measure- 
ment at follow-up (Table 5). Lower physical function 
at baseline was associated with higher levels of de- 
pression and pain at follow-up. Higher levels of pain 
at baseline were associated with higher levels of 
anxiety and depression at follow-up. Female sex 
was associated with lower HADS depression at 
follow-up. Higher levels of pain at follow-up were 
related to higher levels of baseline fatigue, physical 
function and depression (Table 5). 

Duration of illness was inversely associated with 
baseline physical function [mean change in SF-36 
score —0.03; 95% confidence interval (CI) —0.05 to 
—0.01 ; P= 0.01 ] per additional month of illness) and 
associated with baseline pain (mean change in 
visual analogue pain score 0.05; 95% CI 0.02 to 
0.07; P= 0.001 per additional month of illness), 
but was not associated with baseline fatigue or 
mood or with any outcomes at 8-20 months in 
fully adjusted models. 

Service-level factors associated with 
treatment outcomes 

Including services as a categorical variable in multi- 
variable model showed that treatment outcomes 
varied between services. Services reported that 
they treated patients with CBT, GET, a combination 
of both or activity management (Table A1 ). 
Comparing the three services that said they offered 
CBT/GET with the three services that only offered 
activity management, suggested that patients attend- 
ing services offering CBT and GET had less im- 
provement in fatigue at 12 months (fully adjusted 
coefficient 0.57; 95% CI 0.24 to 0.90; 
P= 0.001 corresponding to a difference of ~2 
points on the Chalder Fatigue Scale). However, 
this effect mainly represented a comparison be- 
tween two large services which assessed 372 
(CBT/GET) and 501 (only activity management) pa- 
tients, respectively, in 2010. 



Discussion 

This is the first study to investigate treatment out- 
comes and their predictors in patients treated by 
more than one NHS specialist CFS/ME service. 
Among the patients for whom follow-up data were 
available (51% of patients assessed), there were 
overall improvements in fatigue, physical function, 
anxiety, depression and pain at ~1 2-month 
follow-up. As may be expected, the measurement 
of each of these symptoms at baseline predicted its 
subsequent value when repeated between 8 and 20 
months. Patients who were less physically able at 
assessment had higher levels of fatigue, depression 
and pain at follow-up. Patients who were in more 
pain when assessed at baseline had worse scores for 
all outcomes at follow-up. There was little evidence 
that anxiety or depression was associated with either 
fatigue or physical function at follow-up. The size of 
improvement in fatigue was similar to that achieved 
in the PACE trial after CBT and GET, but this was not 
the case for physical function, where the improve- 
ment was considerably less in clinical services com- 
pared with the PACE trial. 

Strengths and limitations 

This is a large study with follow-up data on more 
than 800 patients with clinically characterized CFS. 
Six clinical services contributed to the outcome 
data, suggesting that the results are generalizable 
to other NHS specialist CFS/ME services which pro- 
vide similar types of treatment. However, as the 
study was based on data from patients accessing 
specialist services, the results may not be general- 
izable to patients who receive similar treatments in 
primary care. The main limitation of this study is its 
relatively low follow-up rate. Compared with pa- 
tients who did not complete follow-up, those who 
did had similar baseline levels of fatigue, physical 
function, pain and mood and similar time-to- 
assessment, but were slightly younger. However, 
we don't know whether patients lost to follow-up 
had better, the same or worse outcomes at 8-20 
months, or how any such differential losses might 
affect results. We used 0.5 SD to define the 
clinically important difference. Although this is 
likely to be close to what patients define as clinically 
important, 23 further work needs to be done to define 
how much change is important to patients and what 
outcomes are relevant. 

Services completed questionnaires about the type 
of treatment offered to patients, but patient-level 
treatment data were not collected by the NOD 
and therefore we cannot be certain what was pro- 
vided. Similarly, services provided estimates of the 
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number of treatment hours received by patients, but 
this was not checked with hospital-level activity 
data. A meta-analysis of CBT and GET for CFS 
found that total treatment hours were one of the 
strongest predictors of outcome. 16 The lack of 
patient-level treatment data and a non-intervention 
group does not allow us to compare treatments or to 
determine whether observed improvements were 
due to the interventions themselves. Although our 
analyses suggested that patients who received activ- 
ity management did better than those who received 
CBT and/or GET, this was based on a comparison 
between three services offering activity management 
only and three services offering CBT/GET only. As 
one service in each group was much larger than the 
others, our result is based on a comparison between 
two services. As treatment content and adherence 
were not assessed, it is not known whether CBT or 
GET conformed to existing protocols or whether ac- 
tivity management provided content consistent with 
CBT or GET principles. There are many other 
service-level factors that could affect outcome be- 
tween two services, such as the number of sessions 
offered by the service, a known predictor of out- 
come, 16 the overall philosophy of the service, indi- 
vidual therapist effects and patient characteristics. 24 
We only have data on depression and anxiety 
from inventories. Patients did not have a psychiatric 
interview. The relationship between depression, 
anxiety and physical function could be explored fur- 
ther using a clinical assessment of depression rather 
than self-report questionnaire scores. 

Comparison with previous literature 

Baseline measures of fatigue and physical function 
in our study were very similar to the PACE trial. The 
mean change in fatigue in our study was 6.8 points 
(95% CI —7.4 to —6.2), which is consistent with 
results from the recently reported PACE trial, 
where the mean change in fatigue was 7.4 points 
in the CBT group. 25 This is in contrast with one pre- 
vious study, which showed that improvement in fa- 
tigue was greater in trial participants. 14 However, in 
our study, the mean improvement on the SF-36 
physical function subscale was much less [4.4 
points (95% CI 3.0 to 5.8)] than in the PACE trial 
where the mean improvement was 1 9.2 points in the 
CBT arm and 21 .0 in the GET arm. This remarkable 
discrepancy suggests that the poor outcome in phys- 
ical function in our study may be due to differences 
in the delivery or content of treatments given in rou- 
tine clinical practice. For example, most patients in 
NHS services appear to be offered five or six ses- 
sions (Table A1), whereas PACE trial participants 



attended 12-14 sessions. This needs urgent 
investigation. 

Treatments in our study also appear to be less ef- 
fective than in other randomized controlled trials 
comparing either CBT or GET with controls. 16,26 . 
Patients enrolled in randomized controlled trials 
can have better results than seen in clinical practice 
because trial clinicians follow a research protocol 
and may have more training, supervision, motiv- 
ation and resources available to them. 14 Trials also 
increase expectation of efficacy and patients may be 
more motivated. 27 Trials usually have selection cri- 
teria exceeding those in routine care (e.g. excluding 
for significant risk of self-harm or co-morbid medical 
conditions and only including those willing and able 
to attend regular appointments and complete re- 
peated questionnaires); it is not known whether 
such factors are associated with treatment 
outcomes. 

Our study showed that older age predicted worse 
outcome at follow-up which is consistent with one 
previous study. 14 There was little evidence that dur- 
ation of illness was associated with any outcomes in 
a fully adjusted model which is consistent with pre- 
vious studies. 16 ' 28 We found that poorer baseline 
physical functioning was associated with higher 
scores on the depression subscale of the Hospital 
Anxiety and Depression Scale at follow-up but not 
vice versa. This is in contrast with two previous 
follow-up studies of a clinical cohort (n = 41) 29 and 
a randomized controlled trial (n = 1 1 4) 28 which sug- 
gested that psychiatric co-morbid diagnoses were 
associated with a poorer outcome 30 . A previous 
cross-sectional study suggested that depression par- 
tially mediated the relation between fatigue and 
physical disability. 31 However, this study used a me- 
diation analysis, which was inappropriate in situ- 
ations where there are unmeasured confounders 
for the effect of both the exposure (fatigue) and the 
intermediate (depression) on the outcome (physical 
function). 32 In chronic fatigue, the relationship be- 
tween depression and fatigue is complex, 33 and the 
relationship with disability is likely to have unmeas- 
ured confounders. This aspect of CFS/ME requires 
further investigation using longitudinal data that 
will allow causality to be determined. 

Conclusions 

Although NHS services are moderately effective in 
improving fatigue in patients with chronic fatigue 
syndrome, they are much less effective in improving 
physical function than similar treatments delivered 
in the PACE trial. This requires urgent investigation 
to determine whether it is due to differences in the 
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delivery or the content of treatments offered by ser- 
vices. Future research also needs to include patient- 
level treatment data to investigate variations in out- 
come that may be related to treatment data. We did 
not find that depression, anxiety or duration of ill- 
ness at assessment predicted outcome. Clinicians 
providing assessments should not assume that 
co-morbid mood disorders or length of illness are 
predictors of outcome. 
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